3574 results found.
Speech
Corpus,
Language Type:
Multilingual
Languages:
Dari/Pashto Dutch English Finnish French Hindi Icelandic Indonesian Japanese Lithuanian Malay Mandarin Nepali Portuguese Punjabi Romanian Slovenian Spanish
Availability:
From Owner
License:
CreativeCommons
Size:
467 hours Production Status:
Newly created-finished
Use:
Person Identification
-
Paper title:JukeBox: A Multilingual Singer Recognition Dataset
-
Paper track:4.3 Speaker verification and identification/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Anurag Chowdhury | JukeBox | /N |
Documentation:
Documentation in English language will be made available upon publication of the dataset.
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Attribution 4.0 International (CC BY 4.0)
Size:
12 GByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Sparse Mixture of Local Experts for Efficient Speech Enhancement
-
Paper track:6.4 Speech enhancement: single-channel/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Aswin Sivaraman | MUSAN | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Attribution 4.0 International (CC BY 4.0)
Size:
60 GByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Sparse Mixture of Local Experts for Efficient Speech Enhancement
-
Paper track:6.4 Speech enhancement: single-channel/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Aswin Sivaraman | LibriSpeech ASR corpus | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
LDC User Agreement for Non-Members
Size:
600 MByte Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Unsupervised Methods for Evaluating Speech Representations
-
Paper track:5.2 Speech analysis and representation/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Michael Gump | TIMIT Acoustic-Phonetic Continuous Speech Corpus | /N |
Documentation:
https://catalog.ldc.upenn.edu/LDC93S1
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons Attribution-ShareAlike (CC BY-SA) license
Size:
105 GByte Production Status:
Existing-used
Use:
Spoken captions for images
-
Paper title:Speech-Image Semantic Alignment Does Not Depend on Any Prior Classification Tasks
-
Paper track:10.1 Multimodal systems/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Masood Mortazavi | The Places 205 Audio Caption Corpus (PlacesAudio400k) | /N |
Documentation:
README distributed with the dataset . . .
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
None Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:Analysis of Disfluency in Children's Speech
-
Paper track:12.10 Metadata for ling./discourse structure (disf/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Trang Tran | SwitchBoard | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
None Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Analysis of Disfluency in Children's Speech
-
Paper track:12.10 Metadata for ling./discourse structure (disf/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Trang Tran | CALLHOME American English Transcripts | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
637.7 hours Production Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:Speech Recognition and Multi-Speaker Diarization of Long Conversations
-
Paper track:10.5 Rich transcription/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Huanru Henry Mao | This American Life Podcasts | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Not Available
License:
Size:
20 hours Production Status:
Newly created-finished
Use:
Mental health assessment
-
Paper title:Learning to Detect Bipolar Disorder and Borderline Personality Disorder with Language and Speech in Non-Clinical Interviews
-
Paper track:12.6 Speech and multimodal resources/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Bo Wang | Automated Monitoring of Symptoms Severity Interviews(AMoSS-I) | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons Attribution-ShareAlike (CC BY-SA)
Size:
4.2 GByte Production Status:
Existing-used
Use:
Speech Synthesis
-
Paper title:Voice Conversion using Speech-to-Speech Neuro-Style Transfer
-
Paper track:7.4 Speech synthesis paradigms and methods/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ehab AlBadawy | Flickr8k Audio | /N |
Documentation:
None




